Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-Learning, Policy Gradients, Game Theory, Decision Making
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82756
posts in
230.0
ms
Reinforcement
Learning from Human
Feedback
arxiv.org
·
20h
🤖
AI Research
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
1d
🤖
AI Research
Adaptive
Neuro-Symbolic
Planning for smart agriculture
microgrid
orchestration in hybrid quantum-classical pipelines
dev.to
·
1h
·
Discuss:
DEV
📊
Quantitative Finance
Cooperative Autonomous Navigation of Legged Robots in Unstructured
Terrains
Using Hierarchical Reinforcement Learning — ## Abstract Legged robotic
plat
...
freederia.com
·
1d
🤖
AI Research
On
Computation
and
Reinforcement
Learning
arxiv.org
·
2d
🤖
AI Research
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
2d
·
Discuss:
DEV
🤖
AI Research
Scientists reveal the alien logic of AI:
hyper-rational
but
stumped
by simple concepts
psypost.org
·
13h
🤖
AI Research
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
1d
·
Discuss:
Hacker News
💬
NLP
**Abstract:** This research proposes a novel framework for predictive behavioral modeling within autonomous agent
ecosystems
, leveraging principles of
Bayesi
...
freederia.com
·
2d
🤖
AI Research
On
Economics
of A(S)I Agents
lesswrong.com
·
15h
📊
Quantitative Finance
Your Best Thinking Is
Wasted
on the Wrong
Decisions
iankduncan.com
·
14h
·
Discuss:
Lobsters
,
Hacker News
📊
Quantitative Finance
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
3d
🤖
AI Research
25W06
. Learning a language with the machine
z1nz0l1n.com
·
5m
💬
NLP
Your Agent Is
Slow
Because of
Inference
futureagi.com
·
1d
·
Discuss:
DEV
🤖
AI Research
Nonlinear random walks on
hypergraphs
characterized
by higher-order interactions
sciencedirect.com
·
19h
📊
Quantitative Finance
Show HN:
A2A
Protocol
– Infrastructure for an Agent-to-Agent Economy
news.ycombinator.com
·
3h
·
Discuss:
Hacker News
🌐
Distributed Systems
Building the Future with AI That
Acts
devxt.com
·
11h
·
Discuss:
Hacker News
🤖
AI Research
The
infamous
coin
toss
ergodicityeconomics.com
·
1d
📊
Quantitative Finance
Continual
learning and the post
monolith
AI era
baseten.co
·
1d
·
Discuss:
Hacker News
🤖
AI Research
The Missing
Layer
Above AI Inference
Governance
vibe.forem.com
·
6h
·
Discuss:
DEV
🤖
AI Research
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help